AITopics | root word

Collaborating Authors

root word

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Investigating Antigram Behaviour using Distributional Semantics

Sengupta, Saptarshi

arXiv.org Artificial IntelligenceOct-25-2023

The field of computational linguistics constantly presents new challenges and topics for research. Whether it be analyzing word usage changes over time or identifying relationships between pairs of seemingly unrelated words. To this point, we identify Anagrams and Antigrams as words possessing such unique properties. The presented work is an exploration into generating anagrams from a given word and determining whether there exists antigram (semantically opposite anagrams) relationships between the pairs of generated anagrams using GloVe embeddings. We propose a rudimentary, yet interpretable, rule-based algorithm for detecting antigrams. On a small dataset of just 12 antigrams, our approach yielded an accuracy of 39\% which shows that there is much work left to be done in this space.

anagram, antigram, similarity score, (15 more...)

arXiv.org Artificial Intelligence

1901.05066

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
North America > United States > Pennsylvania (0.04)
North America > United States > Arizona (0.04)

Genre: Research Report (0.40)

Technology:

Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Lexicon and Rule-based Word Lemmatization Approach for the Somali Language

Mohamed, Shafie Abdi, Mohamed, Muhidin Abdullahi

arXiv.org Artificial IntelligenceAug-3-2023

The lemmatization summary statistics of the Example 3 sentence are also provided in Table 1. In this case, the percentage of words that were normalized for the example reached 100%, which means that all content words (excluding stop words and special characters) are lemmatized. This may be due to the fact that this is a short document, a sentence of 8 words. Unlike the lemmatization statistics of this example, a proportion of words in any typical text document (i.e., longer than a sentence) will normally remain unresolved - words that the algorithm fails to lemmatize in both stages. Overall and as part of evaluating the proposed method, we have tested the algorithm on 120 documents of various lengths including general news articles, and social media posts. For the news articles, we have used extracts (i.e., title and first 1-2 paragraphs) as well as the full articles to see the effect of document length. The results we found for these different document categories are summarized in Table 2. The notations #Docs, Avg Doc Len, and Avg Acc. in the table respectively represent the number of documents, average document length in words, and average lemmatization accuracy. As shown, the results demonstrate that the algorithm achieves a relatively good accuracy of 57% for moderately long documents (e.g.

lemmatization, lexicon, root word, (14 more...)

arXiv.org Artificial Intelligence

2308.01785

Country:

North America > United States > Washington > King County > Seattle (0.04)
Europe > United Kingdom > England > West Midlands > Birmingham (0.04)
Africa > Middle East > Somalia > Banaadir > Mogadishu (0.04)
(3 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Representation & Reasoning > Rule-Based Reasoning (1.00)
Information Technology > Artificial Intelligence > Natural Language (1.00)
Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Semantic Tokenizer for Enhanced Natural Language Processing

Mehta, Sandeep, Shah, Darpan, Kulkarni, Ravindra, Caragea, Cornelia

arXiv.org Artificial IntelligenceApr-24-2023

Traditionally, NLP performance improvement has been focused on improving models and increasing the number of model parameters. NLP vocabulary construction has remained focused on maximizing the number of words represented through subword regularization. We present a novel tokenizer that uses semantics to drive vocabulary construction. The tokenizer includes a trainer that uses stemming to enhance subword formation. Further optimizations and adaptations are implemented to minimize the number of words that cannot be encoded. The encoder is updated to integrate with the trainer. The tokenizer is implemented as a drop-in replacement for the SentencePiece tokenizer. The new tokenizer more than doubles the number of wordforms represented in the vocabulary. The enhanced vocabulary significantly improves NLP model convergence, and improves quality of word and sentence embeddings. Our experimental results show top performance on two Glue tasks using BERT-base, improving on models more than 50X in size.

artificial intelligence, machine learning, natural language, (18 more...)

arXiv.org Artificial Intelligence

2304.12404

Country:

North America > United States > Minnesota > Hennepin County > Minneapolis (0.14)
Oceania > Australia > Victoria > Melbourne (0.04)
North America > United States > Illinois > Cook County > Chicago (0.04)
(5 more...)

Genre: Research Report > New Finding (0.34)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Neural Networks (0.70)
Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.69)

Add feedback

Working with Text -Part 4. Techniques in handling text data

#artificialintelligenceDec-15-2022, 05:40:14 GMT

Example: 'I want to read a book' In the above example there are 6 tokens which are- ('I', 'want, 'to', 'read', 'a' and'book') A type is the class of all tokens containing the same character sequence. In the above example, there are only 5 types which are - 'can, 'you', 'a, 'as' and'canner' as'can', 'as' and'a' are being repeated. In the above example, by deleting period and hyphens between the characters and words we are normalizing the type by making it a term. So the term in the above example is: 'USA' and'antiinflammatory' Example: "Hello everyone.Welcome to the course." The tokens for the given sentence will be -- ['Hello','everyone', 'Welcome', 'to', 'the', 'course'] Welcome to the Natural Language Processing course.

above example, information retrieval, natural language, (16 more...)

#artificialintelligence

Country: North America > United States (0.41)

Genre: Instructional Material > Course Syllabus & Notes (0.47)

Technology: Information Technology > Artificial Intelligence > Natural Language > Information Retrieval (0.35)

Add feedback

IMPORTANT TEXT PRE-PROCESSING TECHNIQUES FOR NLP

#artificialintelligenceJul-26-2022, 13:30:18 GMT

Natural Language Processing (NLP) helps us to communicate or talk with a computer just like we talk to a human. NLP can also be defined as the intersection of Artificial Intelligence (AI), Linguistics and Computer Science, that helps the machine or computer to understand, interpret and manipulate human language. There are two main parts to NLP: 1. Data Preprocessing 2. Algorithm development Here, in this blog we'll be only looking about the first and most important process, "data preprocessing". Data preprocessing is the most essential step for any Machine Learning model. It plays a major role in deciding the performance of the model.

important text pre-processing technique, nlp, paragraph, (13 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.51)

Add feedback

NLP Tutorials Part -I from Basics to Advance - Analytics Vidhya

#artificialintelligenceJan-15-2022, 07:10:48 GMT

All of the topics will be explained using codes of python and popular deep learning and machine learning frameworks, such as sci-kit learn, Keras, and TensorFlow. Natural Language Processing is a part of computer science that allows computers to understand language naturally, as a person does. This means the laptop will comprehend sentiments, speech, answer questions, text summarization, etc. We will not be much talking about its history and evolution. If you are interested, prefer this link.

exploratory data analysis, library, text data, (14 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.50)

Add feedback

Lemmatization In Natural Language Processing -- NLP

#artificialintelligenceJan-3-2022, 11:40:37 GMT

In my previous article I discussed about'Stemming' a process where a given word is chopped off to its root word. If you haven't red my previous article on'Stemming' I insist you to read it before moving any further on this article. Unlike stemming which chop off the given word to its root word'Lemmatization' is a almost similar but it always return you the chopped word which has some dictionary meaning. But lemmatization do care if the word it is returning has meaning or no. A word that is returned by lemmatization can also be called a'lemma'.

lemmatization, natural language processing, root word, (2 more...)

#artificialintelligence

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Text Processing: A Step by Step Guide through Twitter Sentimental Analysis - YOUR DATA GUY

#artificialintelligenceDec-7-2021, 15:50:24 GMT

According to Taweh Beysolow, "Natural Language Processing (NLP) is a subfield of computer science that is focused on allowing computers to understand language in a'natural' way, as humans do." NLP has evolved so rapidly gaining traction in its applications inn artificial intelligence (AI). In this project, we will explore one of the most exciting NLP applications i.e. We will build a machine learning model that can categorize tweets as positive (pro-vaccine), negative (anti-vaccine) or neutral. Stay tuned and let's jump into the project.

dataset, text column, text processing, (15 more...)

#artificialintelligence

Genre:

Workflow (0.50)
Instructional Material > Training Manual (0.40)

Industry:

Health & Medicine > Therapeutic Area > Immunology (0.93)
Health & Medicine > Therapeutic Area > Vaccines (0.74)

Technology: Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.93)

Add feedback

A Complete Guideline to Natural Language Processing (NLP)

#artificialintelligenceOct-20-2021, 02:11:11 GMT

"Language is the road map of a culture. It tells you where its people come from and where they are going" -- Rita Mae Brown I would like to share my real-life experience. Back in 2016, I got myself admitted into a renowned engineering university of Bangladesh aiming to be a computer science graduate. At the very onset of my 4th semester, I came to know about the buzzword machine learning. And immediately I got involved in learning Machine Learning and felt longing to learn about the techniques. I started to study from the very basics of ML algorithms.

information, natural language processing, nlp, (11 more...)

#artificialintelligence

Country:

Asia > Bangladesh (0.25)
North America > United States > New York (0.05)
North America > United States > California (0.05)

Technology: Information Technology > Artificial Intelligence > Natural Language (1.00)

Add feedback

Introduction to Natural Language Processing for Machine Learning

#artificialintelligenceAug-16-2021, 19:00:15 GMT

There is a lot of text present around us. We see it in books, articles, comments, and newspapers. It would be really wise to use this text and convert it into a form that could be easily understood by machine learning and deep learning algorithms. As a result, they would take the processed text and give predictions for different use cases. Natural language processing (NLP) refers to converting natural text into a form that could be used for machine learning purposes.

information, natural language processing, prediction, (11 more...)

#artificialintelligence

Technology:

Information Technology > Artificial Intelligence > Natural Language > Text Processing (0.98)
Information Technology > Artificial Intelligence > Machine Learning > Neural Networks > Deep Learning (0.72)

Add feedback